Organising a photograph collection based on human appearance

نویسنده

  • Bartlomiej Uscilowski
چکیده

This thesis describes a complete framework for organising digital photographs in an unsupervised manner, based on the appearance of people captured in the photographs. Organising a collection of photographs manually, especially providing the identities of people captured in photographs, is a time consuming task. Unsupervised grouping of images containing similar persons makes annotating names easier (as a group of images can be named at once) and enables quick search based on query by example. The full process of unsupervised clustering is discussed in this thesis. Methods for locating facial components are discussed and a technique based on colour image segmentation is proposed and tested. Additionally a method based on the Principal Component Analysis template is tested, too. These provide eye locations required for acquiring a normalised facial image. This image is then preprocessed by a histogram equalisation and feathering, and the features of MPEG-7 face recognition descriptor are extracted. A distance measure proposed in the MPEG-7 standard is used as a similarity measure. Three approaches to grouping that use only face recognition features for clustering are analysed. These are modified k-means, single-link and a method based on a nearest neighbour classifier. The nearest neighbour-based technique is chosen for further experiments with fusing information from several sources. These sources are context-based such as events (party, trip, holidays), the ownership of photographs, and content-based such as information about the colour and texture of the bodies of humans appearing in photographs. Two techniques are proposed for fusing event and ownership (user) information with the face recognition features: a Transferable Belief Model (TBM) and three level clustering. The three level clustering is carried out at “event” level, “user” level and “collection” level. The latter technique proves to be most efficient. For combining body information with the face recognition features, three probabilistic fusion methods are tested. These are the average sum, the generalised product and the maximum rule. Combinations are tested within events and within user collections. This work concludes with a brief discussion on extraction of key images for a representation of each cluster.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Dialectic of Truth and Appearance in Lacan’s Reading of Gerhard Richter’s Overpainted-from-photographs

One of the most important contemporary theoretical approaches that can be used to analyze works of art is the theories of Jacques Lacan (1901-1980), a French post-structuralist psychoanalyst. By combining psychoanalysis, philosophy, linguistics, and anthropology, he analyzes the world of the human subject’s mind in a series of intertwined and extensive cultural and social relationships.This art...

متن کامل

تحلیل عکس با گونه‌های زبانی

With regard to representative characteristic and communicative nature of photograph, and on the other hand, expressive and esthetic capabilities of photography, it can be stated that as human being uses language in various ways and thus creates one of the linguistic types, i.e. prose, verse, and poem, the photographer is also able to create photos with functions and qualities similar to linguis...

متن کامل

Analysis of “Gol -o- Norooz” by Khwaju Kermani According to the Jungian Archetype of Individuation

Carl Gustav Jung, the   founder   of   the   analytical   psychology   in the twentieth century   believes    that   under   the   appearance   of   human   consciousness   exists   an eternal collective unconscious   which is   part   of   the   hereditary   psychological   factor   common in the entire human race. He successfully introduced   the common archetypes in the mythology of   the di...

متن کامل

Self-Organising Maps in Document Classification: A Comparison with Six Machine Learning Methods

This paper focuses on the use of self-organising maps, also known as Kohonen maps, for the classification task of text documents. The aim is to effectively and automatically classify documents to separate classes based on their topics. The classification with self-organising map was tested with three data sets and the results were then compared to those of six well known baseline methods: k-mea...

متن کامل

Colour Appearance Descriptors for Image Browsing and Retrieval

In this paper, we focus on the development of whole-scene colour appearance descriptors for classification to be used in browsing applications. The descriptors can classify a whole-scene image into various categories of semantically-based colour appearance. Colour appearance is an important feature and has been extensively used in image-analysis, retrieval and classification. By using pre-exist...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007